CONTROL METHOD AND SYSTEM FOR A REMOTE VIDEO CHAIN 

The invention pertains to a control method and device for a remote video chain. 
It applies in particular, but not exclusively, to videophone telecommunications. 
In conventional manner, videophone telecommunications use a video 
processing chain comprising a capture module, a coding module and a transmission 
5 module. 

In videophone telecommunications, only a few parameters can be modified by 
those participating in the communication to optimize the video signal they receive. 
These parameters can be divided into two categories. 

The first category of parameters groups together all those commands with 

10 which it is possible to remotely modify the behavior of the coding module of the 
video stream transmitter, by acting on its particular parameters. For example, protocol 
H.323 which applies to multimedia communication systems for IP networks (Internet 
Protocol) offers the possibility of sending a command to enable action upon the 
remote coder so as to increase the quality of images in a transmitted video stream to 

15 the detriment of fluidity, or conversely. Commands of this category are too restrictive 
and only offer the possibility of globally processing the transmitted video signal. 

The second category groups together remote piloting commands for a camera, 
these commands being defined in protocols which lay down standardized commands 
to be sent to a remote camera, so as to modify its field of filming by acting 

20 mechanically on its orientation and zoom. The application of these protocols in 
videophone telecommunications implies the use of compatible motorized cameras. 

The purpose of the present invention is to eliminate these disadvantages. This 
objective is achieved by providing a control method for a remote video processing 
chain producing a video stream to be transmitted to a terminal, with a view to acting 

25 upon the video stream, this method comprising steps during which the terminal 
transmits a command to the remote video processing chain in order to modify the 
video stream, and the remote video processing chain carries out the command on the 
video stream before transmitting it to the terminal. 

According to the invention, the video stream processing commands executable 

30 by the remote video processing chain comprise processing commands to be applied to 
an image region, which contain a definition of an image region of the image contained 
in the transmitted video stream, associated with a definition of a processing operation 
to be applied to the image region, execution by the remote video chain of a processing 
command to be applied to an image region comprising a selection step to select the 

35 image region in the video stream defined by the command, and a processing step to 
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apply the processing operation defined in the command to the image region selected 
in the video stream. 

According to one particular aspect of the invention, this process also comprises 
an activation step to activate an optional module in the video processing chain to carry 
5 out at least in part a command given by the terminal. 

According to another particular aspect of the invention, the video stream 
processing commands executable by the video processing chain comprise a zoom 
command associated with an image region to magnify the image region, a movement 
command to move the image region transmitted in the video stream subsequent to 
10 execution of the zoom command, and optionally a resolution increase command 
associated with an image region to increase the resolution of the image region to the 
detriment of the remainder of the image transmitted in the video stream. 

Advantageously, the video processing chain restores a default image 
configuration in the video stream after a preset time interval starting from the time of 
15 execution of a video stream processing command. 

Preferably, video stream processing commands executable by the video 
processing chain comprise a restore command for restoring a default image 
configuration of the images transmitted in the video stream. 

According to another particular aspect of the invention, the transmission of 
20 processing commands for transmitted video stream is conducted in accordance with 
protocol H.323 or SIP in messages which are ignored by the terminal if it is not 
compliant with them. 

The invention also concerns a system for controlling a first remote video 
processing chain producing a video stream which is transmitted to a first terminal via 
25 a telecommunications network, so as to act upon the transmitted video stream, this 
system comprising transmission means installed in the first terminal for transmitting 
to the first remote video processing chain processing commands to be applied to the 
video stream to be transmitted, and command receiving and execution means installed 
in the first remote video processing chain, for receiving and executing video stream 
30 processing commands on the video stream to be transmitted by the first processing 
chain. 

According to the invention, the video stream processing commands executable 
by the first remote video processing chain comprise processing commands to be 
applied to an image region, containing a definition of an image region of the image 
35 contained in the video stream, associated with a definition of a processing operation to 
be applied to the image region, the command receiving and execution means installed 
in the first remote video processing chain comprising selection means for selecting in 
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the video stream the image region defined in a processing command to be applied to 
said image region, and application means for applying the processing operation 
defined in the command to the image region selected in the video stream. 

According to one particular aspect of the invention this system comprises 
5 optional modules inserted in the first video processing chain to carry out at least in 
part the commands defining a processing operation to be applied to an image region in 
the transmitted video stream. 

Advantageously the optional modules comprise a refraining module. 
According to another particular aspect of the invention, the first video 
10 processing chain is installed in a second terminal comprising transmission means for 
transmitting commands to a second video processing chain installed in the first 
terminal to modify a video stream transmitted by the second video processing chain, 
the second video processing chain comprising command receiving and execution 
means for modifying the video stream it transmits. 
15 One preferred embodiment of the invention is described below by way of a 

non-restrictive example with reference to the appended drawings in which: 



Figure 1 schematizes an example of a videophone telecommunications system 
to which the method of the invention applies; 
20 Figure 2 gives a more detailed diagram of a video processing and command 

processing chain of the invention, integrated in a terminal; and 

Figure 3 shows the software organization of programs installed in a terminal. 



Figure 1 shows a videophone telecommunications system comprising two 
25 terminals 2, 3 between which a videophone telecommunication is set up. Each 
terminal is equipped with a display screen 5, command members such as a keyboard 4 
and mouse 6, a camera 7, a microphone 8 and at least one speaker 9. 

In the example shown in figure 1, terminal 2 is connected via a local network 
10 and a local server 1 1 to a public telecommunications network 1, while terminal 3 is 
30 connected directly to network 1 . 

During a videophone telecommunication, terminals 2, 3 via networks 1,10 and 
server 1 exchange video streams containing the images taken by cameras 7 and 
sounds captured by microphones 8. The exchanged video streams conform for 
example to protocol H.263+ and to QCIF image format (Quarter Common 
35 Intermediate), a frequently used video image format in videophone and 
videoconferencing communications. Evidently the invention may also apply to other 
image formats such as SQCIF, CIF, 4CIF, 16 CIF. 
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To generate said video stream, each terminal 2, 3 comprises a video processing 
chain such as the one shown in figure 2. This chain is piloted by a command 
processing chain 21. The video processing chain 20 comprises a video transmission 
part to transmit video images taken by the camera or derived from another source 19 
5 to another terminal, and a video receiving part to receive a video stream emitted by 
this other terminal and to display it on screen 5. The command processing chain 
comprises a command transmission part designed to transmit commands to the video 
processing chain of the other terminal, and a command receiving part designed to 
receive commands emitted by the other terminal and to pilot the transmission part of 
10 the video processing chain in relation to commands received. 

More precisely, the video transmission part comprises a capture module 22 
which receives the video images 21 taken by camera 7 of the terminal or derived from 
another source, and a coding module 24 designed to code the video stream provided 
by capture module 22, in particular to compress this stream so as to adapt it to the 
15 assigned transmission rate, and a transmission module 26 connected to a 
communication module of the terminal to place the coded video stream, generated by 
coding module 24, in the form of packets for its transmission. 

The video receiving part comprises a video receiving module 29 designed to 
reconstitute a video stream from the packets received by the terminal, a video 
20 decoding module 30 connected to the receiving module and designed to decode the 
video stream, and a display module 3 1 which is connected on one side to the decoding 
module and on the other side to the display screen 5 of the terminal to reconstruct the 
decoded video stream. 

According to the invention, optional modules 23, 25 are also inserted in the 
25 transmission part of the video processing chain 20 upstream and/or downstream from 
coding module 24, and/or integrated in the latter depending upon the configuration of 
terminal 2, 3. 

The receiving part of the command processing chain comprises a command 
receiving module 28 which delivers received commands to a control module 27, this 

30 module piloting the emitting part of the video processing chain, and in particular the 
capture module, optional modules 23, 25 and coding module 24 as well as the video 
source 19 when applicable. 

The transmission part of the command processing chain comprises a command 
module 32 which is connected to control members 4, 6 of the terminal, and which 

35 generates commands from the commands given by the user by means of terminal 
control members 4, 6, and a command transmission module 33 connected to command 
module 32 and which transmits commands generated by command module 32. 
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As shown in figure 3, the two processing chains (video and commands) 20, 21 
are in the form of an applicative software layer. The command processing layer 21 
uses a software layer 41 implementing protocol H.323 for example or SIP (Session 
Initiation Protocol) to transmit and receive commands from a remote terminal. If 
5 protocol H.323 is used, the commands are transmitted in the form of "Facility" 
messages enabling the transmission of owner data. If the SIP protocol is used, the 
commands may be transmitted using the "INFO" message. These two types of 
messages have the advantage of being ignored by the addressee terminal if it is not 
compliant therewith, so that it remains possible to maintain the interoperability of the 
10 terminals. 

The video processing layer 20 uses a software layer 40 implementing protocol 
RTP (Real Time Protocol) or analogue, to transmit and receive video streams in the 
form of packets. The two layers 40 and 41 are designed to be interfaced with a layer 
42 implementing the protocol TCP/IP (Transmission Control Protocol/Internet 

15 Protocol) grouping together the protocols used for the Internet network. Layer 42 can 
be used to physically access the network using transport layer 43. 

During a videophone telecommunication for example between terminals 2 and 
3, terminal 2 receives a video stream H263+ in QCIF format which is processed by 
the receiving part of the video processing chain 20. Through the invention the user 

20 can act upon the video stream received by means of keyboard 4 and/or mouse 6 by 
applying commands to the transmission part of the command processing chain 21. 
These commands are received and processed by the receiving part of the command 
processing chain 21 of the other terminal 3 and are used to pilot the transmission part 
of the video processing chain of terminal 3. 

25 The piloting commands which are generated by command module 32 are in the 

following format for example: 
Region<specification of an image region>, 
<Commands to be applied to the region>) 
Typically, an image region is defined by its shape and size, for example 

30 "RECTANGLE (Coordinates of upper left point>, Coordinates of lower right 
point>)". 

In addition to conventional commands with which to act on the quality of the 
transmitted images and therefore on transmission rate, these commands may be: 

35 - ZOOM to magnify the region specified in the command, 

- BETTERJtESOLUTION to obtain better resolution of the region specified 
in the command, 
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- GO to move the magnified region to another part of the basic image. 

If the user gives a zoom command, for example in the form 
"Region(RECTANGLE((80 5 80) ? (176 5 144)),ZOOM)" to magnify a defined region of 
5 the image, the control module 27 can apply several solutions to carry out this 
command. 

If the camera is motorized with zoom command, the control module 27 
transmits a command to be applied to the camera for best framing of the region 
indicated in the command. 

10 If the camera is not motorized, or if the command cannot be fully carried out by 

the camera (for example if the requested magnification exceeds the zoom capacity of 
the camera) the control module 27 applies to capture module 22 a format change 
command for incoming images so as to obtain greater resolution, for example 
requesting format CIF(3 52x288 pixels) instead of previously selected QCIF (176 x 

15 144 pixels). Control module 27 then activates optional refraining module 23 between 
the capture module 22 and the coding module so as only to select one part of the 
image captured and more precisely that part of the image contained in the region 
indicated in the received command. If the coding module 24 is compliant, the control 
module 27 may also modify its outgoing format so that it corresponds to that of the 

20 specified region, i.e. in the example of the above command, 96 x 64 pixels. 

Normally, to increase the resolution of a video stream, the transmission rate has 
to be increased since the data transmitted to the decoding module 30 is greater. To 
solve this problem the invention proposes the local increase in resolution of video 
images by assigning all or part of the transmission stream available to only one part of 

25 the image. The transmission part of the video processing chain will then concentrate 
its resources on the region requested by the remote user so as to offer the best quality 
possible by degrading or not coding those zones located outside the requested region. 

Therefore if reception module 28 receives a command for better resolution, for 
example: 

30 "Region (RECTANGLE ((80,86),(176,144)), BETTER JR£SOLUTION)", in order 
obtain better resolution in a defined region of the image, the control module 27 will 
act on the coding module 24 to allocate the maximum of resources (stream) to coding 
the region indicated in the command, to the detriment of the remainder of the image. 

If, in the received video images, part of the image is a still image, the user can 

35 therefore act on the remote video chain using this command so as no longer to receive 
this still part and to concentrate the resources of the video transmission chain on the 
desired part of the image. 
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To move the field of filming, the user can give a command in the following 
form for example: 

Region(RECTANGLE((- 1 76,-20),(0, 1 24)),GO). 

In this command, the coordinates of the requested image region are expressed 
5 in relation to the current image region. 

To process this command, there are several possibilities depending on whether 
or not the capture source can be moved. The capture source may be a motorized 
camera or a module for capturing a window in a still image (for example a computer 
display window). If the capture source can be moved, the command module acts 
10 directly on the source to endeavor to reach the region specified in the received 
command. If not, the control module 27 activates and commands an optional 
refraining module 23 and applies to it the region of the image to be transmitted. 

Provision may be made for those modifications applied to the transmission part 
of the video processing chain to have a temporary nature and to disappear after a 
1 5 preset time interval, the transmission part then reverting to a default configuration. 

Alternately, these modifications remain final until receipt of another command. 
Provision may be made in this case for a restore command to restore a default 
configuration. 

Evidently, commands other than those described above may be provided 
20 without departing from the scope of the invention. For example the remote terminal 
may be a server which produces a video image grouping together several video 
streams for videoconferencing. In this case, commands may be provided to act 
specifically on one of the video streams and on the presentation of the video streams 
in the image. 
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